Including Language Model Information in the Combination of Handwritten Text Line Recognisers

نویسندگان

  • Roman Bertolami
  • Horst Bunke
چکیده

This paper proposes a novel language model based combination method for ensembles of offline handwritten text line recognisers. The individual recognisers are based on hidden Markov models and the ensembles are generated with the bagging method. The proposed combination method extends the ROVER framework by rescoring the word transition networks with a language model. Experiments conducted on a large database of offline handwritten text lines show that the proposed approach can improve the recognition accuracy over a reference system as well as over the original ROVER combination method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ensemble methods for offline handwritten text line recognition

This thesis investigates ensemble methods for offline recognition of English handwritten text lines. Multiple recognisers are automatically generated from a single base recognition system. Combining the output of these multiple recognisers provides the final ensemble result. The underlying recognisers are based on hidden Markov models. One model is built for each character. Based on the lexicon...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

Combination of Multiple Handwritten Text Line Recognition Systems with a Recursive Approach

In this paper we propose a novel method to combine the results of multiple text line recognition systems. The method uses a recursive approach and re-examines those parts in a text line which have been rejected based on the initial combination of the base recognisers’ results. By means of the new method, the search space can be reduced, and therefore more accurate recognition results can be exp...

متن کامل

Character-Based Handwritten Text Recognition of Multilingual Documents

An effective approach to transcribe handwritten text documents is to follow a sequential interactive approach. During the supervision phase, user corrections are incorporated into the system through an ongoing retraining process. In the case of multilingual documents with a high percentage of out-of-vocabulary (OOV) words, two principal issues arise. On the one hand, a minor yet important matte...

متن کامل

HMM-Based On-Line Recognition of Handwritten Whiteboard Notes

In this paper we present an on-line recognition system for handwritten texts acquired from a whiteboard. This input modality has received relatively little attention in the handwriting recognition community in the past. The system proposed in this paper uses state-of-the-art normalization and feature extraction strategies to transform a handwritten text line into a sequence of feature vectors. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008